CDS

Accession Number TCMCG075C06023
gbkey CDS
Protein Id XP_017971003.1
Location complement(join(7291060..7291062,7291120..7291532,7291890..7292059,7292170..7292484,7292743..7292867,7294158..7294493))
Gene LOC18608310
GeneID 18608310
Organism Theobroma cacao

Protein

Length 453aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018115514.1
Definition PREDICTED: 5'-3' exoribonuclease isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description hydrolase activity, acting on ester bonds
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00188        [VIEW IN KEGG]
R11188        [VIEW IN KEGG]
KEGG_rclass RC00078        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K07053        [VIEW IN KEGG]
EC 3.1.3.97        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGTTGGGCTTGGCGATAGCAATTCCAGTTGCAATAATAACAACAAAGCAAAAGACAAGAAGAGGAAGAAGAAGAAGAAGCGTGGGGGTAGCAAGCGGAAGATGACCGCTGAGCAAACTCTGGCTTTCAAGTCGGTGACTGAATGGGTTTATTTGGATCACCAGAATTCTTCTTCCACAGCGGCCTTATCTTCGTGGGTGGTGGATGATTTTGGGGTGCAGAAGAGTCTGGGAAGAGGCATGGAGAAAGTGGTGTTTGAATTGCATTCCCATTCTAAACATAGTGATGGGTTCTTGTCCCCTTCTAAGCTCGTTGAGAGAGCTCATGGCAATGGGGTGAAAGTTCTTGCTCTGACAGATCATGATACAATGTCTGGGATCCCTGAGGCCATAGAAACAGCTCGCAGATTTGGTATCAAGATAATCCCTGGTGTTGAGATCAGCACAATATTCTCACCAAGCAGGAACTCTGAAATGGAGGAACCAGTGCACATCCTTGCATATTACAGCAGCTGTGGCCCAACAAGGTATGAGGAGCTGGAAACGTTCTTGGCTAACATAAGGGATGGACGTTATCTCCGTGCAAAGGACATGGTTTTGAAACTCAATAAACTCAAGCTACCTCTTAAGTGGGAGCATGTTACTAAGATTGCAGGCAAGGGAGTGGCTCCTGGGAGACTGCATGTAGCTCGAGCTATGGTTGAAGCAGGTTATGTAGAGAATCTAAAACAAGCTTTTGCCAGATATTTATATGATGGTGGACCTGCTTATTCCACGGGAAGCGAGCCTCTTGCAGAAGAAGCAGTGCAGCTTATATGTGAAACAGGGGGTTTAGCAGTGCTGGCTCATCCCTGGGCACTAAAGAATCCTATTCCCATCATAAGAAGGTTAAAAGATGCAGGGCTTCATGGAATGGAGGTTTACAGAAGTGATGGAAGATTGGCAGCGTACAGTGACTTGGCAGATACTTATGACCTTTTGAAGCTTGGAGGAGCAGATTATCATGGGAGAGGTGGGCATGGTGAGTCTGAACTAGGAAGTGTGAACCTTCCAGTGTTGGTTTTGCATGACTTTCTTAAGGTAGCTCGACCTATCTGGTGTGGTGCCATTAAGGACATTTTAGAGACTTATGCAGAGGAACCCTCTGATTCAAATCTAGCCAGGATTGCAAGATTTGGGAGGATGGGCAGTTTCAGGGGAAGTTCTCCCTTGAGTTGTGGCCAGGACTTTATTGATTGTTGTTTATCATCCTGGTTGACTACCGAAGAAAGGCAGAATGCTGAGTTTGAGGCTATTAGATTGAAGCTTTCCTATATTTCAATCGATCTGGGTGAAGTACAAGCTCCTATAGGGAGTAAATGA
Protein:  
MVGLGDSNSSCNNNNKAKDKKRKKKKKRGGSKRKMTAEQTLAFKSVTEWVYLDHQNSSSTAALSSWVVDDFGVQKSLGRGMEKVVFELHSHSKHSDGFLSPSKLVERAHGNGVKVLALTDHDTMSGIPEAIETARRFGIKIIPGVEISTIFSPSRNSEMEEPVHILAYYSSCGPTRYEELETFLANIRDGRYLRAKDMVLKLNKLKLPLKWEHVTKIAGKGVAPGRLHVARAMVEAGYVENLKQAFARYLYDGGPAYSTGSEPLAEEAVQLICETGGLAVLAHPWALKNPIPIIRRLKDAGLHGMEVYRSDGRLAAYSDLADTYDLLKLGGADYHGRGGHGESELGSVNLPVLVLHDFLKVARPIWCGAIKDILETYAEEPSDSNLARIARFGRMGSFRGSSPLSCGQDFIDCCLSSWLTTEERQNAEFEAIRLKLSYISIDLGEVQAPIGSK